Benchmarking Encrypted Data Storage in HBase and Cassandra with YCSB
نویسندگان
چکیده
Using cloud storage servers to manage large amounts of data has gained increased interest due to their advantages (like availability and scalability). A major disadvantage of cloud storage providers, however, is their lack of security features. In this article we analyze a cloud storage setting where confidentiality of outsourced data is maintained by letting the client encrypt all data records before sending them to the cloud storage. Our main focus is on benchmarking and quantifying the performance loss that such a cloud storage system incurs due to encrypted storage. We present results based on a modification of the Yahoo! Cloud Serving Benchmark using the AES implementation of the Bouncy Castle Java Cryptography Provider for the encryption and decryption steps. The results show that for single read and write operations the performance loss is acceptable (even for stronger encryption with 256 bit keylength) while for range scans the impact can be quite severe.
منابع مشابه
Benchmarking Replication and Consistency Strategies in Cloud Serving Databases: HBase and Cassandra
Databases serving OLTP operations generated by cloud applications have been widely researched and deployed nowadays. Such cloud serving databases like BigTable, HBase, Cassandra, Azure and many others are designed to handle a large number of concurrent requests performed on the cloud end. Such systems can elastically scale out to thousands of commodity hardware by using a shared nothing distrib...
متن کاملBenchmarking Scalability and Elasticity of Distributed Database Systems
Distributed database system performance benchmarks are an important source of information for decision makers who must select the right technology for their data management problems. Since important decisions rely on trustworthy experimental data, it is necessary to reproduce experiments and verify the results. We reproduce performance and scalability benchmarking experiments of HBase and Cassa...
متن کاملComparison of Database and Workload Types Performance in Cloud Environments
The rapid growth of unstructured data over the last few years, has led to the emergence of new database management systems. Traditional relational databases, despite their wide adoption and plethora of features, begin to show weaknesses when having to deal with very large amounts of data. Numerous types of databases have emerged in the Cloud domain, in order to exploit the elasticity of Cloud e...
متن کاملPerformance Analysis Of Scalable Sql And Nosql Databases : A Quantitative Approach
PERFORMANCE ANALYSIS OF SCALABLE SQL AND NOSQL DATABASES: AQUANTITATIVE APPROACHby HARISH BALASUBRAMANIANMay 2014Advisor: Dr.Weisong ShiMajor: Computer ScienceDegree: Master of Science Benchmarking is a common method in evaluating and choosing a NoSQL database.There are already lots of benchmarking reports available in internet and research papers. Most ofthe ben...
متن کاملLocality-Aware GC Optimisations for Big Data Workloads
Many Big Data analytics and IoT scenarios rely on fast and non-relational storage (NoSQL) to help processing massive amounts of data. In addition, managed runtimes (e.g. JVM) are now widely used to support the execution of these NoSQL storage solutions, particularly when dealing with Big Data key-value store-driven applications. The benefits of such runtimes can however be limited by automatic ...
متن کامل